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Abstract. Let pi < p2 < ■ ■ ■ < Pv < ■ ■ ■ be the sequence of prime numbers and 
let m be a positive integer. We give a strong asymptotic formula for the distribution 
of the set of integers having prime factorizations of the form Pm''iPm''2 ■ ■ ■Pm*'n with 
fci < ^2 < • ■ ■ < kn- Such integers originate in various combinatorial counting 
problems; when m = 2, they arise as Matula numbers of certain rooted trees. 



1. Introduction 

Let {pv}'^^i be the sequence of all prime numbers arranged in increasing order and 
let m > 1 be a fixed positive integer. We shall consider the class of integers only 
admitting prime factors from the subsequence {Pm^lfcLo' ^^^^ 

(1.1) Am = ■ ■ ■ Pm^r, G N : ^1 < ^2 " " " < ^n} • 

The aim of this article is to provide an asymptotic formula for the distribution of Am-, 
namely, the following counting function 

n<.x 

The function M2^2 arises in various interesting combinatorial counting problems; 
particularly, in connection with rooted trees. In 1968 Matula gave an enumeration of 
(non-planar) rooted trees by prime factorization [20j, the so-called Matula numbers. 
Number theoretic aspects of this rooted tree coding have been investigated in detail in 
P, [13] . Such numbers may be used to deduce many intrinsic properties of rooted trees 
[H [121 [15]. The set A2 in fact corresponds to a class of Matula numbers. In Section [2] we 
review Matula coding of rooted trees and give the interpretation of M2,2 as the counting 
function of rooted trees with height less or equal to 2, under Matula's enumeration. It 
is worth mentioning that the significance of Matula numbers comes from applications 
in organic chemistry, as they can be employed to develop efficient nomenclatures for 
representing molecules of a variety of organic compounds (cf. [51 [HI [3 [SI [II])- As 
explained in Section [21 M2^2 might also be regarded as a "transfinite counting function" 
for the ordinal u'^ in a certain complexity norm ^25j . 

In [25] Weiermann found the weak asymptotics of the counting function M2,2- Using 
a Tauberian theorem by Kohlbecker for partitions [18], he showed that 
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[1.2) logM2,2(x) -vr. 



' 2 log X 
3 log 2 



The asymptotic relation fll.2p resembles the one obtained by Hardy and Ramanujan 
in 1917 for the celebrated (unrestricted) partition function, 

f2n 

(1.3) logp(n) ~ Y ' 

which they (16j, and independently Uspensky j24], greatly refined later to 

(1.4) p{n) 



I 2n 
3 



(4v^) 



n 



Naturally, the transition from (11. 3p to (11.41) consists in finding missing asymptotic 
terms. The problem we address here is of similar nature. We shall fill the gap be- 
tween (II. 2p and the strong asymptotics by exhibiting hidden lower order terms in the 
approximation (II. 2p . as stated in the following theorem. 

Theorem 1. The function M2^rn has asymptotic behavior 



e^WSlogm '°Hvife; / /21ogx log2(loga;) 

(1.5) M2,m(x) ~ — — — — (logx) 2iogm exp vr/ 



2tt'^ log 2 V V 3 log m 8 log m 

where 

Km = 777^ — f (loglogm)^ + 7^ - 27 log log m - ^ - log^ f — =^=)) - C2,m , 
2 log m \ 6 \ V 6 log m / / 

7 is the Euler-Mascheroni constant, and C2,m is given by the convergent series 

ri A 1 17 11 l°g^ log log m\ 

C2,m = y A loglogp^fc - logfc - loglogm - — . 

\ kiogm klogm J 

We will provide a proof of Theorem [T] in Section |H The proof is based on Ingham's 
method from [17j; however, it turns out that Ingham's original theorem for partitions 
[T71 Thm. 2] is not directly applicable to our context. In Section El we shall slightly 
extend his result. It is likely that such an extension of Ingham's theorem might be 
useful for treating partition problems other than the one dealt with in this article. 



2. Two COUNTING PROBLEMS AND M2,m 

2.1. Rooted trees. Matula's coding of (non-planar) rooted trees in terms of prime 
factorizations provides a bijection between such trees and the positive integers. The 
same rooted tree enumeration was rediscovered by Gobel in [TT]. It is defined as follows. 
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If we denote the trivial one-vertex tree by •, then its Matula number is n(«) := 1. 
Inductively, if Ti, T2, . . . , Ti are trees and T is given as 

T2 Q---r)Ti_, 



then its Matula number is defined as n(T) := Pn{Ti) ■ ■ -Pn{Ti)- 

If Ti^k is the tree of height one with k nodes above the root, then n{Ti^k) = Pi = 2^^. 
If T has height two, then 

where the j-th node connected to the root carries a tree Ti^kj- 

It is then clear that Matula coding gives a bijection between the set of rooted trees 
with height equal to 1 or 2 and the set A2 defined in (II .ip . Consequently, M2^2{x) 
counts the number of Matula numbers corresponding to trees with < height (T) < 2, 
that are below x, i.e., 

1 =M2,2(X) + 1. 

n{T)<x 
height (T)< 2 

Thus, this rooted tree counting function has also asymptotics (11. 5p . 

2.2. Ordinal counting functions. It might seem surprising at first sight that the 
counting function M2,2 is related to studying asymptotic properties of transfinite ordi- 
nals. Since transfinite ordinals rarely show up in a number-theoretic context we will 
explain some features of this connection in informal and general terms. The rest of 
the paper will not depend on the exposition given in this subsection, but it might be 
useful as a source of inspiration for further study. 

In naive set theory ordinals generalize the ordering of the natural numbers < 
1 < 2 < . . . by continuing beyond the first limit point u like < 1 < 2 < . . . < 
u) + l<u) + 2<.... This process can be continued beyond the next limit u + u like 
0<l<2<...<a; + l<a; + 2<...<c<;-2 + l<w-2 + 2<... and by iteration like 
0<l<2<...<u + l<u + 2<...<u-2 + l<u-2<...<u-3<...<u-n.... 
At a certain moment we reach the first limit of limits u ■ u and, again by iteration, we 
reach limits of limits of limits and in the limit of this counting we reach u'^ (an ordinal 
which - as will become clear soon - is of relevance to M2^m)- 

The ordinal uj'^ is not at all frightening since it appears as the order type of the 
polynomials in lN[x] under eventual domination or as the order type of the multisets of 
natural numbers. There is of course no bound in counting through the ordinals and by 
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further counting we reach oj^ ,uj^ , . . ., but the higher we go the more comphcated 
the description becomes. Shght extensions could still be dealt with by combinatorial 
means (which can still be formalized in Peano arithmetic) and stronger extensions will 
require from some moment onwards basic set theoretic machinery. 

There is still some nice and accessible visualization of the ordinals less than Eq, which 
is the limit of the finite powers of u showing up in the sequence u ^ uj'^ , u^'^ , u^'^ , . . .. 
For this we consider a subclass of Hardy's orders of infinity. Let £ be the class of unary 
functions / : IN — )■ IN such that 

(1) The function cq is an element of £ where cq{x) = x and 

(2) with two functions f,g'm£ also the function h is in £, where h{x) = x^^^^ +g{x). 

On £ we define the ordering of eventual domination as usual hj f -< g if and only if 
there exists a non- negative integer k such that f{x) < g{x) for all x > k. The structure 
{£, -<) is isomorphic with {{a : a < Eq}, <) and so we can identify both structures. 
If we also write id = Cq for the identity function on IN, then the isomorphism maps 
CO to id, uj^ to id**^, uj'^" to irf*'^* , etc. The ordinal e^ is the proof-theoretic ordinal of 
first order Peano arithmetic PA. PA proves (after an appropriate formalization of the 
context) the scheme of transfinite induction for all strict initial segments of Eq but not 
the scheme of transfinite induction for the full segment up to Eq. 

For treating £ in the context of arithmetic we need a specific (easily definable) coding 
of the elements of £ into the natural numbers. One of the standard devices for achieving 
this is provided by associating to the elements of £ their canonical counterparts in the 
finite non-planar rooted trees. Such a bijection t can defined recursively as follows. 
First, t(co) := •. If / = id^^ + ■■■ + id^" then let 



By this identification we can - using Matula's coding - canonically associate to f G £ 
its Godel number [/] := n{t{f)). This coding has been used explicitly by Troelstra and 
Schwichtenberg in [231 P- 320, Def. 10.1.5]. In this context we arrive at the following 
interpretation 



For coding a larger segment of ordinals Schiitte [22, Sec. V.8] used a related coding 
Nr which when restricted to £ has the property that M2a{x) is the number of ordinals 
a below u'^ such that Nr{a) < x. 

Until now, the study of ordinal counting functions has found applications to logical 
limit laws for ordinals and to phase transitions for Godel incompleteness results (it 
seems very interesting and intriguing to find additional applications). A further dis- 
cussion of phase transitions will be beyond the scope of this exposition, but we want 
to include an intriguing example for a zero one law (see pj for an account on logical 
limit laws). As usual, we use |= for the satisfaction relation from model theory. Let cp 




M2,2{x) = i^{fe£:f^id''A\f] <x}. 
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be a sentence in the language of linear orders. Let 

_ ^{feS■.f^^d^''^\f^<x^{{ge8■.g<f},^) |=y} 
i^{f e£:f -<^d^'^^\f^<x} 

Then 5,^ exists and either = 1 or 5^ = 0. A proof of this and similar results has been 
obtained in [26] by an analysis of the asymptotic behavior of M2,2 and related counting 
functions. 

At the beginning of this subsection it has been indicated that ordinals might provide 
a source of inspiration for further research and we will now indicate some possible 
options. For g e £, let Cg{x) := #{/ E £ : f g A \f] < x}. For various choices 
of g some preliminary results on weak asymptotics for Cg have been obtained in [25j. 
Moreover strong asymptotics for c be obtained by elementary means. 

We believe that the methods of this paper will allow one to provide strong asymp- 
totics for c-^idk for any given fixed k. A strong asymptotic formula for c-^nid (which 
would resemble something like multiplicative double partitions) seems however to re- 
quire new methods. A general challenge would be then to provide a general theorem 
on strong asymptotics for Cg for any fixed g and for analogous functions emerging from 
the Schiitte coding. 

3. An EXTENSION OF INGHAM'S THEOREM FOR UNRESTRICTED PARTITIONS 

As mentioned at the Introduction, we need an extension of Ingham's theorem for 
strong asymptotics of partition functions. The extension will follow from a complex 
Tauberian theorem for large behavior of the Laplace transform, also due to Ingham 
[13 Thm. 1']. 

Let < Ao < Ai < ■ ■ ■ < Afc — )■ oo be a sequence of real numbers and let 

N{u) = J2 1 

\k<u 

be its counting function. Consider the additive semigroup A generated by {A^j^Q, i.e., 

I 

A = {r eR:r = Y^ n^Afc, nfc G N} . 

For r G A, the partition function p{r) is defined as the number of ways of writing r as 
r = "^k^onkXh- We further set 

r<u 
reA 

The following theorem obtains the asymptotic behavior of P{u) if one knows a certain 
average asymptotic behavior for N{u). It slightly extends that of Ingham by allowing 
an extra term of the form B log^ u in the asymptotic expansion fl3.1l) . As usual, ( stands 
for the Riemann zeta function and F for the Euler Gamma function. The constant 71 
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denotes the Stieltjes constant, that is, 



7i = hm > 

n. — i-rxn f ^ 



" log k loK^ n 



n—>-oo '—^ k 
k=l 



Theorem 2. Suppose that 

(3.1) [ ^^dt = -u'' + B\og^u + C\ogu + D + o{l), 

Jo t « 

with a, A > 0. Then 



(3.2) P{u) ~ ' e^'M-(^+i)M^-''^-t exp + B log^ 



M 



where 



/3 = -, M = {AaT{a + l)C{a + l))—^, D' = D + { — - - -f^ ) B. 

a + 1 \ 6 / 

In order to deduce Theorem [2] from Ingham's Tauberian theorem, we proceed to find 
the asymptotic behavior of the Laplace-Stieltjes transform of P. Set 



F(s) = ^e""^^- = / e-'^'dPiu 
Jo 



k=0 

and 



The generating function identity F(s) = e^^^^ is well-known. 

Lemma 1. // (13. ip holds, then, as a — )■ 0^, 

p N{u) ^ Ar{a + l)C{a + 1) ^ ^, ^, 

cr / du = ^ ^^^^ + B log^ 0- - C log a + + o(l) . 

/n e'^" — 1 cr" 



Here 

(3.3) i;' = Z)+('!^_27i-f )^ 



Proof. We employ standard Schwartz distribution calculus in our manipulations. It 
may also be possible to give a classical proof along the lines of that of [191 Thm. 
IV.23.1]. For Schwartz distributions, we follow the notation exactly as in [9], Chap. 2]. 
Taking distributional derivative in (13. ip . we obtain 

-AX-ul-^ + (B log^ A + C log A + D)Siu) + 2B Pf ^^^'^ " 



u \ u 

+ (251ogA + C)Pf (^^^^ +o(l), A^oo, 

distributionally in the space of tempered distributions S' (cf. [Q"^ Sec. 3.9], [STj Sec. 
2.5]), where 6 stands for the Dirac delta distribution, H is the Heaviside function, 
and Pf denotes regularization via Hadamard finite part [9, Sec. 2.4]. Testing this 
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asymptotic expansion at the test function = M/(e" — 1), setting a = 1/A, and 
taking into account the well-known formula 

(3.4) r(s + l)C(s + 1) = / du, 3fJes>0, 

Jo e" - 1 

we obtain 



Jo ^ ~ ^ ^ 



where D' = D + 2BK' + CK, and the constants K and K' are given by the Hadamard 
finite part at of the integrals 

/<- = F.p,r^ and A-F.p.r^.„. 
io - 1 Jo - 1 

Hence, it remains to evaluate these two constants. We will do so by inspecting the 
Laurent expansion at s = of the analytic continuation of fl3.4l) . In fact, the clas- 
sical procedure of Marcel Riesz P [10] yields the analytic continuation of (13. 4p to 
C \ {0, — 1, —2, . . . } as the finite part integral of the right hand side. By employing the 
Gelfand-Shilov Laurent expansion at s = for the distribution (TUl p. 87], we 
conclude 

du , < I s I < 1 



r W , 1 >r^s"^ /-"^log^M 

Jo - 1 s f-^ n! Jo - 1 



n=0 

On the other hand, since 

1 /^^ TT^' 

C(s + 1) = - + 7 - 7is + . . . and r(s + 1) = 1 - 7s + + — ) + 



12 



we have 



1 /tt^ 

r(s + l)C(s + 1) = - + ( - 71 - y ) s + • • • , < |s| < 1 , 

and therefore K = and K' = - 71 - 7V2. □ 

Proof of Theorem [H We can now apply Ingham's Tauberian theorem [T7| Thm. 1'] to 
the generating function F{s) with 

^{s) ^^ -4r(a + l)C(a + l) ^(,):=e^'.-^e^'-^^ 
Indeed, in view of Lemma [H 

F(a) ~ x(^)e'^^"\ a^0+, 

and the quoted theorem of Ingham immediately implies that 

Xjcrju)) exp (<^(a(n)) + ua{u)) 

r[u) ~ , , u — > 00 , 

v/27r(T2(M)^"(a(M)) 
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where (j{u) is the inverse function of —f', i.e., <j{u) = Mu~~ with M = {AaT{a + 
l)C(a + 1))^. Setting (3 = a/{a + 1), we have 

^ip{<T{u))+ucr{u) ^''^'^ 



a{u)^2T:^"{a{u)) \ 2n J 

whence (13.21) follows. □ 

In the sequel, we will only use the case a = 1 of Theorem [2l which we state in the 
next corollary for the sake of convenience. 



Corollary 1. Suppose that 

N{t) 



t 



dt = Au + Blog'^u + Chgu + D + o(l) 



Then 

P{u) ) ^^-^-''^"^^^^^exp I vr^/^ + ^log^^ 



3 « V V e; / c_i_R, / / 2Au B 



2v^ V 

where D' is given by ( IJ. 



4. The original problem 



We now proceed to give a proof of Theorem [H We translate our original problem 
into an additive partition problem. Consider = logp^*- Then, with the notation of 
the preceding section, 

M2,Uen = 5^p(r) = P{u) . 



r<u 
■re A 



Of course, here p{r) = 1 for each r G A = |X]fc=o ^°S^'m'= • ^fe ^ n|. 

Thus, we are interested in the average asymptotics of the counting function 

N{u)= Yl 1- 



<U 

tlL-" 

Observe that 

(4.1) [ ^dt = N{u)logu- V loglogp^fe 
Jo 1 ^ 

We first need to estimate loglogp^fc- 
Lemma 2. 

tAo\ 11 17,11 , , loglogm flog^k 

(4.2) log logp^fc = log + log log m + — h — — h O 



klogm klogm \ k"^ 
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Proof. Using the prime number theorem, Cipolla [3] found in 1902 an asymptotic for- 
mula for Pn. Employing just two terms in the expansion, we have 

Pn = n log n + 0{n log log n) . 

This leads to 

\ogpn = log n + log log n + O 

and 



log log n 
logn 



loglogn / (log log 2 
log log Pn = log log n H h C 



log n \ log n 

Thus, for n = m'', we obtain the required formula. □ 

Lemma [2] immediately yields: 

Corollary 2. 

rAi\ n \/i n ^ 1 ; 1 1 ^^S^ loglogm 

(4.3) C2,m = y A log(logPmO - log fc - log log m - 



fc=i ^ 



k log m k log m 



converges. 

Next, 
Lemma 3. 

r , M log^M 1 , 

4.4 / ^dt = - logM + D„ + ol , 

Jo ^ logm 21ogm 2 



(loglogm)^ /I /logm\ 7i loglogm 

= + log -\\ — - C2,yn " :j \ 7 • 

2 log m \ log 2 V 27r / log m log m 

Proof. We first notice that 

log(7r(e")) 

p^k < if and only li k <y := — , 

logm 

where 7r(x) is the distribution of the prime numbers. By the prime number theorem, 

U logM , ^ A 

y = + - 

log m log m \u 
Thus N{u) = \_y\ +1 and, combining equations (14. ip . (14. 2 p and (14. Sp . 



-^dt= {[y\ + 1) logM - loglog2 - ^log(logp^ft) 

fc=i 



( [y\ + 1) log w - log log 2 - C2,m - log( [yj !) - (log log m) [y\ 

1 ^\ogk loglogm ^1 
logm ^-^ k logm ^-^ k 

° A;=l ^ k=l 
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Using Stirling's formula and the defining formulas for 7 and 71 

log(n!) = nlogn — n + - logra + log(A/27r) + o(l) , 

7 = -logn + o(l) and 7i = "^f -^|— + ^(l) , 

k=l k=l 

we have 

''^W^. I 11 f ^ A . 1 ^1 I iQg' L^J /^l loglogm^, 

— — rft = [y\ log + logM + [y\ - — + — log [y\ 

t \ [yj log 171/ 2 log m \2 log m 

^ ^ o n 1 ^ ^1 loglogm 

- log log 2 - Ca,^ - log(V27r) - 7 + o(l) . 

log m log m 



Since 



Li/J log rnr:— = ly\ log i + , ,m_^ = — r— — + o 



ly\ log m J \ ly\ log m J log m V 



log [y\ = \og{y + 0(1)) = logy + O ( - J = logw — loglogm + 



and 



we obtain fl4.4p . as required. □ 

The asjTTiptotic formula (11.51) follows by combining Lemma [3] and Corollary [1] after 
a straightforward calculation. The proof of Theorem [T] is complete. 
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